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WO 01/16339 PCT/CA00/00977 

USE OF ARAB INOGALACTAN PROTEIN FUSION CONSTRUCTS IN A METHOD OF EXPRESSING 
PROTEINS AND PEPTIDES IN PLANTS 

The present invention is in the field of plant molecular engineering. In 
particular, the invention provides a method to express proteins and peptides in plants 
5 using arabinogalactan fusion proteins and peptides. 

BACKGROUND OF THE INVENTION 

Humans have long looked to plants as a source of medicines, as well as a 
10 source of food. Although most medicinal compounds from plants are non- 
proteinaceous in nature, plants produce a variety of proteins which have 
immunophysiological functions when ingested (Arai, 1996 Biosci. Biotech Biochem. 
60:9). Some examples are the hypotensive effects of a zein peptide from maize and 
the lowering of blood cholesterol by a glycinin peptide from soybean (Arai, 1996 
15 Biosci. Biotech Biochem. 60:9). The development of transformation techniques has 
now extended the potential of plants to produce a much broader range of proteins with 
medicinal/physiological effects, including antibodies, antigens, antibiotics, and 
growth factors (Owen and Pen, 1996 Transgenic Plants). Since such proteins 
originate from organisms quite different from plants, they also confer a much wider 
20 range of biological activity to plants than is otherwise available. 

There are now many examples of "successful" expression of foreign genes in 
plants, depending on the purpose of expression. Several bacterial enzymes have been 
expressed at satisfactory levels with very little modification, other than regulatory 

25 sequences at the 5' and 3' ends. Well-known examples are the genes for the NPT-II, 
GUS, CAT, and PAT enzymes. The protein levels needed for an enzyme, however, 
may be considerably less than what is needed for non-enzymatic proteins. Early 
attempts to express the Bt gene, from Bacillus thuringiensis, in plants are probably 
more typical of this type of research. Initial constructs containing the naturally- 

30 occurring form of the gene, or truncated versions thereof, produced levels of the 
protein which could not be detected on Western blots, although some biological 
activity was evident from feeding trials (Vaeck et al 1987 Nature 328:33). It was only 
after extensive re-designing of the gene that adequate levels of the Bt protein were 
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achieved in plant tissue (Perlak et al 1991 PNAS 88:3324). There are many other 
examples (and probably many unreported) of expression levels of foreign proteins in 
plants which are very low, i.e. 0.001% of total protein or less (e.g. Higo et al 1993 
Biosci. Biotech. Biochem. 57:1477 and Gomez et al 1998 Virology 249:352). 

5 

Several factors leading to low expression levels of foreign proteins in plants 
have been postulated, including A/T content, codons atypical of plants, premature 
poly-A signals, mRNA destabilizing sequences and fortuitous intron-like sequences 
(Koziel et al 1996 Plant Mo!. Bio.l 32:393; Gallic 1996 In: Transgenic Plants (eds. 

10 Owen and Pen) pp. 49-74). Some support for these assertions has been provided by 
increased expression of re-engineered versions of these genes, as in the Bt example. 
However, there are now probably numerous examples which demonstrate that 
addressing only these factors may not be sufficient. Experiments with a swine viral 
antigen and a swine growth factor has shown that the complete re-synthesis of genes 

15 with these factors in mind may not result in increases in protein levels in transgenic 
plants which justify the effort (unpublished results). 

Several strategies have been adopted to address the problem of low levels of 
foreign protein expression in genetically transformed plants. These strategies include 

20 targeting the transgene protein to the endoplasmic reticulum (ER), apoplast, or 
chloroplast, and translational fusions. The role of the signal peptide is now well- 
established in directing protein synthesis in plants to the ER, as well as the roles of 
other intrinsic sequences in retaining proteins in the ER (H/KDEL) or targeting them 
to the vacuole (eg. in various seed storage proteins) (Conceicao and Raikhel 1996 In: 

25 Transgenic Plants (eds. Owen and Pen) pp. 75-98). The highest levels of Bt protein 
(3-5%) reported to date have been synthesized in the chloroplast following integration 
of the gene in the chloroplast genome (McBride et al 1995. Bio/Technol. 13:362). 
Wandelt et al (1992. Plant J. 2: 181) increased the amount of vicillin by 20 fold in 
alfalfa leaves by attaching a KDEL sequence to the C terminus of the coding region, 

30 thereby retaining the protein in the ER. In contrast, Verwoerd et al. (1995. Plant 
Physiol. 109:1 199) achieved the highest levels of phytase in tobacco leaves through 
secretion to the apoplast. 
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Expression of foreign proteins and peptides in plants has also been achieved 
by means of translational fusions to larger "carrier" proteins. A prominent example 
was expression of enkephalin, a 5-aa neuropeptide, as pan of a seed storage protein, 
reaching levels of 0,1% of total extractable protein (Van dekerckhove et al 1989. 
5 Bio/TechnoL). Another variation of this approach was the synthesis in plants of a 
malarial epitope as a part of the coat protein of the tobacco mosaic vims for use as a 
vaccine (Dalsgaard et al 1997 Nat. Biotech 15:248). Fusions to beta-glucuronidase, a 
bacterial reporter gene, and green fluorescent protein have been used to not only 
detect expression of the attached transgene protein, but also determine its location and 

10 distribution in tissues (e.g. Parmenter et al 1996 In: Transgenic Plants (eds. Owen 
and Pen) pp. 261-280). Finally, attachment to an endogenous plant protein was 
effective for the synthesis in plants of two mammalian proteins with pharmaceutical 
potential: the human cytokine, granulocyte-macrophage colony-stimulating factor, 
fused to the signal peptide and first 8 aa of a rice glutelin protein (Sardana et al. 1998 

15 In: Recombinant Proteins from Plants (eds. Cunningham and Porter) pp. 77-87); and 
the human anticoagulant peptide hirudin, produced as an appendage to the 3' end of 
the oleosin protein (Parmenter et al. 1996 In: Transgenic Plants (eds. Owen and Pen) 
pp .261-280). In this latter case the fusion to oleosin localized the hirudin to the oil 
bodies of canola, as the oleosin protein is attached to the membranes of those bodies, 

20 which facilitates isolation of the protein by flotation of the oil bodies after grinding of 
the seed in aqueous buffer. 

There is a vast range of mammalian proteins and peptides that one may wish 
to express in plants for a variety of reasons. Antimicrobial peptides and derivatives 

25 thereof could provide plants with some degree of resistance to bacteria and fungi 
(Hancock and Lehrer, Trends Biotechno). 16:82;). There is also a major interest in 
using plants as bioreactors to produce high- value proteins with pharmacological 
properties (e.g. Parmenter et al 1996 In: Transgenic Plants (eds. Owen and Pen) pp. 
261-280; Sardana et al. 1998 In: Recombinant Proteins from Plants (eds. Cunningham 

30 and Porter) pp. 77-87; Cramer et al 1996 In: Transgenic Plants (eds. Owen and Pen) 
pp.299-310). Another obstacle to the use of plants for such purposes involves not 
only the production of the proteins, but also the extraction and purification of proteins 
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and peptides from plant tissues. An option is to orally consume plant tissue in some 
form to obtain the benefit of the new protein produced therein. Presumably such 
proteins would be active primarily within the digestive tract (e.g. phytase, antibiotic 
peptides), affect mucosal tissues and development (e.g. epidermal growth factor), or 
5 stimulate an immune response, particularly a mucosal immune response, which would 
result in the secretion of protective IgA molecules into the intestinal tract. 

Despite these advances in plant molecular engineering, there remains many 
important proteins and peptides that have not proven amenable to high level 
10 expression in plants. There is therefore a continuing need for new methods for the 
expression of foreign proteins and peptides in plants at acceptable levels. There is 
also a need for a method for the expression of foreign proteins and peptide in plants 
that allows for the ready extraction and purification of the target protein or peptide. 

15 It is an object of the invention to overcome disadvantages of the prior art. 

The above object is met by the combination of features of the main claim, the 
sub-claims disclose further advantageous embodiments of the invention. 



20 
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SUMMARY OF THE INVENTION 

The present invention provides a method of expressing a foreign or 
heterologous protein or peptide in a cell. In particular, the invention is directed to a 
5 method of increasing foreign protein and peptide expression in a plant cell comprising 
the use of an arabinogalactan fusion product. 

The present invention pertains to a method of expressing a foreign protein or 
peptide in a cell comprising the steps of: 

10 (a) making a fusion construct comprising one or more regulatory elements in 
operative association with a chimeric gene, the chimeric gene comprising a 
first nucleic acid encoding an arabinogalactan protein or a fragment thereof 
and a second nucleic acid encoding a foreign protein or peptide to be 
expressed in the cell; and 

15 (b) introducing said fusion construct into a host cell so that the cell expresses the 
chimeric gene. 

Further, the invention provides a method as defined above wherein the first 
nucleic acid comprises a portion of a gene that encodes an arabinogalactan protein 
20 isolated from alfalfa pollen. In particular, the portion of a gene that encodes an 
arabinogalactan protein is that shown in SEQ ID NO:l (Figure 1). 

In another aspect of the invention, there is provided a method as defined above 
wherein the host cell is a plant cell. 



25 



The present invention also involves recombinant vectors comprising 
regulatory elements, a portion of a gene that, encodes an arabinogalactan protein and a 
DNA segment encoding the foreign protein or peptide to be expressed in the cell. 



The invention also extends to cells comprising any one of the recombinant 
vectors described above. In particular these cells are plant cells. 
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The invention also provides for a transgenic plant, or a transgenic seed, the 
genome of which has been transformed with any of the fusion constructs described 
above. 

5 This summary of the invention does not necessarily describe all necessary 

features of the invention but that the invention may also reside in a sub-combination 
of the described features. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 

These and other features of the invention will become more apparent from the 
following description in which reference is made to the appended drawings in which: 

Figure 1 shows the nucleotide and amino acid sequence of alfalfa pollen 
15 arabinogalactan (SEQ ID NO's: 1 and 2): 

Figure 2 is a graphical representation of the full length alfalfa pollen arabinogalactan 
protein. 

20 Figure 3 is the amino acid sequence of the fusion protein derived from truncated 
alfalfa pollen, arabinogalactan and the D-epitope of porcine transmissible 
gastroenteritis virus (TGEV; SEQ ID NO:3). 

Figure 4 is a graphical representation of the pBluescript® SK n+ vector. 

25 

Figure 5 is the nucleotide sequence of the truncated alfalfa pollen arabinogalactan as 
subcloned into pBluescript® SK (SEQ ID NO:4). The double underline 
represents the Haell cleavage site. The recognition sequence for Haell is 
rgcgc/y, and therefore this restriction enzyme cuts between nucleotides 710 
30 and 711. 
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Figure 6 shows the oligomers encoding the modified TGEV epitope. Annealing of 
the two 60-mers (sense, SEQ ED NO:5, and amisense, SEQ ED NO:6) resulted 
in "sticky ends" at both ;he 5' and 3' ends. 

5 Figure 7 is a graphical representation of the truncated alfalfa pollen arabinogaiactan 
and the synthetic D-epitope of TGEV. 

Figure 8 is a graphical representation of rhe pBISNl binary vector used for plant 
transformation. 

Figure 9 is a Western Blot showing expression of the truncated alfalfa pollen 
arabinogaiactan and the D-epitope of TGEV fusion protein in alfalfa 
(immunobloting with polyclonal antibody from chickens against the truncated 
alfalfa pollen arabinogaiactan). Lane A is the standard low range molecular 
weight marker; Lanes B-F are protein isolates from 5 transgenic plants; Lane I 
is an alfalfa tissue culture control. 

Figure 10 shows the geometric mean titre (GMT) of TGEV-specific antibodies from 
serum retrieved from mice immunized with P02-D in alfalfa (left hand 
column) or in tobacco (right hand column). Two different assays were done 
with each serum sample, and a different GMT was calculated for virus 
neutralization (VN) and for TGEV-recognition (ELISA). The error bars 
indicate standard deviation. 



10 



15 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention involves a method of expressing foreign or heterologous 
proteins and peptides in a cell. In particular, the invention concerns a method of 
5 increasing foreign protein and peptide expression in a plant cell comprising the use of 
arabinogalactan fusion protein and peptides. 

The following description is of a preferred embodiment by way of example 
only and without limitation to the combination of features necessary for carrying the 
10 invention into effect. 

Arabinogalactan proteins (AGPs) are found in flowering plants from every 
taxonomic group tested. They constitute a major class of proteins in the extracellular 
matrix between the pollen and pistil and are also widely distributed in other plant 

15 tissues (Chasan, R. Plant Cell 1994, 6:1519; Cheung, A.Y. Proc. Nat. Acad. Sci. USA, 
1995, 92:3077; Showalter, A.M. Plant Cell 1993, 5:9). AGPs and their corresponding 
cDNA's have been isolated from pistils and suspension cells (Chen, C.C. etal Proc. 
Natl. Acad. wSci. USA 1994, 6:1643; Du, H. et al Plant Cell 1994, 6:1843; Mau, S. et 
al Plant J, 1995, 8:269). AGPs have also been found in the plasma membrane of 

20 pollen vegetative cells (PennelL R.I. et al J. Cell. Biol. 1989, 108:1967) and sperm 
cells (Pennell, R.I. et al Plant Cell 1991, 3: 1317), and the cell wall of pollen tubes 
(Li, Y. et al Planta 1992, 188:532). Two homologous cDNAs encoding putative 
AGPs were isolated from Brassica napus (Gerster, J. et al Plani Physiol. 1996 
110:1231). 

25 

AGPs that are particularly useful in the method of this invention are those that 
are, non -membrane bound, non-enzymatic and non-structural. These characteristics 
make it easier for the AGP fusion protein or peptide to accumulate in the tissue with 
minimal adverse effects on cell function. However, AGPs that are membrane bound, 
30 enzymatic, or structural may also be used. The highly glycosylated nature of AGPs 
provides these molecules with the tendency to form aggregates. This "sticky" nature 
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AGPs adds to their potential as fusion proteins as this characteristic may improve 
isolation and purification of the desired foreign protein or peptide. Such aggregates 
may also be useful for protecting a foreign protein or peptide from degradation, both 
in plant tissue and in the digestive tract of animals. 

A pollen-derived DNA sequence was cloned as cDNA from an alfalfa pollen- 
specific library, and sequence analysis of the inferred protein structure indicated it to 
be an arabinogalactan protein (AGP) (Qiu et al 1997 Plant Sci. 124:41). 
Interestingly, O-glycosylation is universally found to increase the resistance of the 
underlying peptide to proteases (Jentoft 1990 TIBS 15:291), a property that could 
contribute to the utility of AGPs as fusion proteins and peptides. 

The alfalfa pollen AGP is a secreted protein, 259 amino acids long and it has 3 
domains (see Figure 2). Domain 1 extends from residue 1 to residue 20, and its 
sequence matches the consensus sequence of an amino-terminal signal peptide. The 
presence of such a region is consistent with the observed extracellular iocation of this 
AGP (unpublished data). Domain 2 is the central domain of the peptide, and it can be 
further divided into two subdomains; the first spans amino acids 21-130 and is 
enriched in aspartate and glutamate, while the second extends from residue 131-238 
and is very high in alanine, serine and proline. The O-glycosylation characteristic of 
AGPs is expected to occur within the region enriched for proline in domain 2 of this 
AGP. Domain 3 is a hydrophobic region from 239-259, and software analysis of the 
amino acid sequence suggests that it is a transmembrane helix (Qiu et ai, 1997 Plant 
Sci. 124:41). This third domain has been removed in the fusion construct made here. 
Many AGPs have, at the csrboxy-terminus, a hydrophobic domain. This domain 
represents an idea! target for removal and replacement with a sequence encoding a 
foreign protein or peptide of interest to make a chimeric gene to be expressed in a cell 
or host organism. However, it is to be understood that other fragments of the AGP 
protein, or other domains of AGP may be used or deleted as required to prepare a 
fusion protein as described herein. A fragment of AGP comprising at least 6 amino 
acids of the charged domain (i.e. nucleotides 1-27 of SEQ ID N02) is desired for the 
preparation of a fusion protein or peptide as described herein. For example, which are 
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not to be considered limiting in any manner, a fragment of AGP from about 1 to 
about 130 (of SEQ ID NO:2; including the signal peptide and charged domain, see 
Figure 2), or an AGP fragment comprising amino acids 1 to a position within the 
glycosylated domain (between amino acids 131-238; SEQ ID NO:2; including the 
5 signal peptide, charged domain and a portion of the glycosylated domain, see Figure 
2) may be used as described herein for the preparation of a fusion protein, however 
other lengths of the nucleotide seqquence disclosed in SEQ DI NO:2 may be used as 
needed . A fusion protein or peptide comprising all or a portion of the charged 
domain (nucleotides 1-130 of SEQ ID NO:2, or a fragment thereof) of AGP may be 
10 useful for the expression of cationic peptides, since the charged domain helps enhance 
the stability of the fusion protein. 

All AGPs characterized to date are secreted and therefore ultimately end up in 
association with the plant cell wall. If expressed within a plant, the secretion of the 

15 AGP fusion protein or peptide into the plant cell wall may protect the protein or 
peptide from rapid degradation in the stomach of animals or humans by trapping the 
protein or peptide in an "enteric coating" consisting of a matrix of cellulose and 
hemicellulose microfibrils, lignin and the like. The gradual break-down of the cell 
wall would result in an extended release of the protein or peptide into the intestinal 

20 lumen of the human or animal eating the plant, where it would be needed for some 
immunophysiological function. However, the AGP fusion construct of the present 
invention may also be expressed in other host cell expression systems, for example, 
bacterial, yeast, fungal, insect or mammalian. In some cases, it may be that a small 
peptide, fused to an AGP or a fragment thereof, must be released from the AGP in 

25 order to have biological activity. Such cleavage could be achieved by any means 
known to one of skill in the art., for example but not limited to proteolytic degradation, 
for example, the action of trypsin at a trypsin site inserted into the chimeric gene 
between the AGP and the peptide. The trypsin enzyme is very specific regarding its 
amino acid cleavage site (following an arginine or lysine) and is abundant in the 

30 intestinal tract of humans and livestock animals. It is preferred for a trypsin site to be 
added only to fusion construct for those proteins or peptides containing no other 
trypsin sites. 
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The particulate structure of plant cell wall fragments containing a foreign 
protein or peptide may also confer an immunological advantage to an antigen 
produced in plants for the purpose of an oral vaccine, delivered as plant tissue. Such 
particulate structures are considered important for the generation of a mucosal 
5 immune response, important for protection against enteric pathogens. These 

pathogens also present themselves as paniculate structures to intestinal tissues which 
sample contents of the lumen. Association with cell wall constituents, in a natural or 
chemically modified state, such as cellulose, may provide the added benefit of 
adjuvant properties, thereby enhancing the immunogenicity of antigens co-presented 
10 in plant tissue as oral vaccines. 

The present invention provides a method of expressing a foreign gene in a cell 
comprising the steps of: 

(a) making a fusion construct comprising one or more regulatory elements, a 

15 portion of a gene that encodes an arabinogalactan protein (a first nucleic acid) 

and a. DNA segment encoding a foreign protein or peptide to be expressed in 
the cell (a second nucleic acid); and 

(b) introducing said fusion construct into a host cell so that the cell expresses the 
fusion. 

20 

The term "regulatory elements" as used herein is meant to describe DNA 
sequences which control the initiation, speed and termination of transcription. This 
may include promoter and termination sequences as are known to one of skill in the 
art. 

25 

The term "promoter" as used herein refers to the nucleotide sequences at the 5' 
end of a coding region of DNA, or fragment thereof that contain all the signals 
essential for the initiation of transcription and for the regulation of the rate of 
transcription. The promoters used to exemplify this invention are constitutive 
30 promoters that are known to those skilled in the art. By "constitutive promoter" it is 
meant a promoter that directs the expression of a gene throughout the various parts of 
a host, for example a plant and continuously throughout its development. In plants, 
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examples of such promoters known in the art include, but are not limited to, the 
synthetic super promoter (Ni, et al Plant J 1995, 7:661, which is incorporated herein 
by reference) and the CaMV 35S promoter (Odell et a]., 1985, Nature, 313: 810-812, 
which is incorporated by reference). The preferred constitutive promoter is the 
5 synthetic super promoter. If tissue specific expression of a gene is desired, for 

example seed or leaf specific expression, then promoters specific to these tissues may 
also be employed. 

Furthermore, as would be known to those skilled in the art, inducible 

10 promoters may also be used to regulate the expression of the gene following induction 
of expression by providing the appropriate stimulus (inducer) for inducing expression. 
The use of inducible promoters may be required as constitutive synthesis of a foreign 
protein or peptide may, in some cases, be toxic or inhibit normal growth of the host, 
for example a plant, with the result that the only plants which regenerate following 

15 expression are those expressing low levels of the foreign protein or peptide. In the 
absence of an inducer, the DNA sequences or genes will not be transcribed. 
Typically, the protein factor that binds specifically to an inducible promoter to 
activate transcription is not present or present in an inactive form which is then 
directly or indirectly converted to the active from by the inducer or inducing 

20 treatment. The inducer may be a chemical agent such as a protein, metabolite, growth 
regulator, herbicide or phenolic compound, a physiological stress imposed directly by 
heat, cold, salt or toxic elements or indirectly through the action of a pathogen or 
disease agent such as a virus. A plant ceil containing an inducible promoter may be 
exposed to an inducer by externally applying the inducer to the cell or plant such as 

25 by spraying, watering, heating or similar methods. 

The chimeric gene construct of the present invention can further comprise a 3' 
untranslated region. ' A 3' untranslated region refers to that portion of a gene 
comprising a DNA segment that contains 2. polyadenylation signal and any other 
30 regulatory signals capable of effecting mRNA processing or gene expression. The 
polyadenylation signal is usually characterized by effecting the addition of 
polyadenylic acid tracks to the 3' end of the mRNA precursor. Polyadenylation 
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signals are commonly recognized by the presence of homology to the canonical form 
5* AATAAA-3' although variations are not uncommon. 

Examples of suitable 3 1 regions for use in plant cells, are the 3' transcribed 
5 non-translated regions containing a polyadenylation signal of Agrobacterium tumor 
inducing (Ti) plasmid genes, such as the nopaline synthase (Nos gene) and plant genes 
such as the soybean storage protein genes, and the small subunit of the ribulose-1, 5- 
bisphosphate carboxylase (ssRUBISCO) gene. The 3 ! untranslated region from the 
structural gene of the present, construct can therefore be used to construct chimeric 
10 genes for expression in plants. 

The chimeric gene construct of the present invention can also include further 
enhancers, either translation or transcription enhancers, as may be required. These 
enhancer regions are well known to persons skilled in the art, and can include the 

15 ATG initiation codon and adjacent sequences. The initiation codon must be in phase 
with the reading frame of the coding sequence to ensure translation of the entire 
sequence. The translation control signals and initiation codons can be from a variety 
of origins, both natural and synthetic. Translational initiation regions may be 
provided from the source of the transcriptional initiation region, or from the structural 

20 gene. The sequence can also be derived from the regulatory element selected to 

express the gene, and can be specifically modified so as to increase translation of the 
mRNA. 

To aid in identification of transformed host cells, the constructs of this 
25 invention may be further manipulated to include selectable markers. Useful selectable 
markers include enzymes which provide for resistance to an antibiotic such as 
gentamycin, hygromycin. kanamycin and the like, or resistance to herbicides such as 
glufosinate. Similarly, enzymes providing for production of a compound identifiable 
by colour change, such as GUS (^-glucuronidase), or luminescence, such as 
30 luciferase, axe useful. 
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The portion of a gene encoding an arabinogalactan protein (first nucleic acid) 
is meant to encompass any gene encoding an arabinogalactan protein, of a fragment 
thereof, that is capable of acting as a fusion protein for the expression of a foreign 
protein or peptide in a cell, in particular, a plant cell. Preferably, but not to be 
5 considered limiting in any manner, the arabinogalactan protein should have a non- 
enzymatic or non-structural function and be non-membrane bound. It is also desirable 
for the protein to have a hydrophobic: region at the carboxy-terminus. The carboxy 
terminus may be removed and substituted with the foreign protein or peptide (the 
second nucleic acid) to be expressed in the cell using conventional molecular biology 

10 techniques known to those skilled in the art.. It is to be understood, however, that 
other portions of the AGP may be removed to produce an AGP fusion protein. More 
preferably, which should not be considered limiting, the AGP is a secreted protein 
found in the pollen. Most preferably., the portion of a gene encoding an AGP is an 
alfalfa pollen AGP, in particular, comprising the nucleotide sequence shown in SEQ 

15 ID NO:l, and. the amino acid sequence shown in SEQ ID NO:2 (Figure 1). However, 
it is to be understood that fragments of the nucleotide and amino acid sequences 
shown in SEQ ID NO's 1 and 2 may be used as a first nucleic acid. For example, 
which is not to be considered limiting in any manner, a fragment of AGP comprising 
amino acids 1 to about amino acid 130, or amino acid 1 to about amino acid 237 (of 

20 SEQ ID NO:2) may be used as described herein, however, other fragments of the 
AGP protein may also be used. 

The DN A segment encoding a foreign protein or peptide to be expressed in the 
cell may be any gene or DNA segment encoding any protein or peptide desired to be 

25 produced in the cell, in particular a plant cell. The term "foreign protein or peptide", 
also referred to as a "second nucleic acid", is meant to encompass any protein or 
peptide that is not normally expressed in the target cell. Examples of proteins or 
peptides that might be the target of expression in plane cells, include, but are not 
limited to, proteins or peptide? of pharmaceutical interest, such as antibodies, 

30 antigens, antibiotics, growth factors, hormones, lymphokines and activators. Also 
included are protein or peptide:; which are produced in a plant to enhance an 
agronomic or industrial trait, for example, but not limited to, herbicide resistance and 
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stress tolerance. Particular proteins or peptides thai can be expressed in a plant cell 
using an arabinogaiactan fusion protein include anirna! viral antigens, for example, 
but not limited to, an antigenic portion of porcine transmissible gastroenteritis virus 
(TGEV). This animal viral antigen may be expressed in a plant cell as an AGP fusion 
5 product, for example, as a modified D-epitope of TGE V, as shown in Figure 6 (SEQ 
ID NO;s 5 and 6). 

The present invention therefore provides a DNA comprising one or more 
regulatory elements in operative association with a chimeric gene, the chimeric gene 
10 comprising a first nucleic acid encoding an arabinogaiactan protein or a fragment 
thereof, fused with a second nucleic acid encoding a foreign protein or peptide. 
Furthermore, this invention includes a vector comprising the DNA just described. 

The use of the fusion system of the present invention might be used to enhance 

15 the levels of foreign proteins or peptides produced at low levels in a host, for example 
a plant. Within plants, the 5' end of the AGP gene contains naturally-occurring 
signals for the initiation of transcription and translation, which are major limiting 
steps in foreign gene expression in plants. However, it is to be understood that other 
regulatory regions, including promoters and enhancers may be used as desired with 

20 the AGP gene of the present invention in order to optimize expression of the AGP 
fusion protein. Of much greater significance, however, is the probable necessity of 
such a system for the synthesis of small peptides which may range in size from, say 6 
amino acids to perhaps 30 amino acids. It is very unlikely that peptides such as these 
could be produced in several hosts, for example plants, or that they would be stable, 

25 unless fused to a larger, stable protein. Such peptides, when produced in their natural 
hosts, are synthesized as part of a larger precursor protein, which is subsequently 
cleaved to yield the much smaller, biologically-active peptide. An example is 
glucagon-like peptide-2 (GLP-2), a 33--amino acid peptide, produced in humans as 
part of a much larger protein from which it is cleaved in specific tissues at certain 

30 times. The nucleic and amino acid sequences for GLP-2 are known (Drucken D. 
Diabetes 1998 r 47:159-169). It is highly unlikely that satisfactory levels of such 
peptides could be produced in plants without a fusion system such that proposed here. 
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It should be noted that GLP-2 does not contain a trypsin site within its sequence, 
making it a good target for the incorporation of such a site between the AGP gene and 
the peptide gene of the fusion construct. Another example of a smaller peptide, ideal 
for this application is porcine or human epidermal growth factor (EGF) (REF). 

5 

It is believed that expression of proteins or peptides using the method of this 
invention may be produced within a range of host systems, and in the case where a 
plant is the host.this expression is independent of the species of the plant. All plants 
are believed to express arabinogalac tan-type proteins and therefore this strategy 
10 should be suitable for all crop species., including monocots and dicots. Particularly, 
the method is suitable for food and food crop plants. More particularly, the method is 
suitable for dicots such as alfalfa, clovers, soybean, pea and tobacco. 

Plant gene expression vectors can, and have been as described herein, 

15 constructed specifically to express arabinogalactan fusion proteins in plant cells. The 
construction of plant expression vectors must be adapted for use with plant 
transformation procedures for transferring those vectors into transgenic plants. 
However, it is to be understood that one of skill in the art may readily adapt the AGP 
fusion system of the present invention for the expression of fusion proteins of interest 

20 within any suitable host system, including bacterial, fungal, yeast, or anima) 

expression systems. By "transformation" it is meant the stable interspecific transfer of 
genetic information that is manifested phenotypically. For example, which is not to 
be considered limiting in any manner the constructs of the present invention can be 
introduced into plant cells using Ti plasmids.. Ri plasmids, plant vims vectors, direct 

25 DNA transformation, micro-injection, electroporation, etc. For reviews of such 
techniques see for example Miki and Iyer, Fundamentals of Gene Tran sfer in Plants . 
In Plant Metabolism, 2d Ed. DT. Dennis, DH Turpin, DD Lefebrve, DB Layzell (eds), 
Addison Wesly, Langmans Ltd. London, pp, 561-579 (1997), and more specifically, 
alfalfa may be transformed according 3s described in 

30 www.oac.uoguelph.ca/www/CRSC/embryo/proced.htm], Tobacco leaf discs may be 
transformed using a protocol described in Fisher*/ al (1995 Plant Mol. Biol. Rep, 
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13:278). The present invention further includes a suitable vector comprising the 
chimeric gene construct. 

5 The chimeric gene product expressed in a desired host may be feed to an 

animal or extracted and administered to an animal. For example which is not to be 
considered limiting in any manner, of the fusion product, or the foreign protein or 
peptide, as described herein, may be introduced into any desired plant, including 
forage plants, food crops, or other plants depending upon the need. Examples of such 
10 plants include, but not limited to, alfalfa, corn, barley, tobacco, and potato. In the 
experiments outlined below, alfalfa and tobacco have been used as test organisms for 
the expression of a fusion product, however it is to be understood that the constructs 
of the present invention may be introduced and expressed in any plant, other host 
organism, or cell expression system. 

15 

It is contemplated that if a plant is the host, then a transgenic plant comprising 
a heterologous fusion protein as described herein may be administered to an animal in 
a variety of ways depending upon the need and the situation. For example, if the 
protein is orally administered, the plant tissue may be harvested and directly feed to 

20 the animal, or the harvested tissue may be dried prior to feeding, or the animal may be 
permitted to graze on the plant with no prior harvest taking place. It is also considered 
within the scope of this invention for the harvested plant tissues to be provided as a 
food supplement within animal feed. If the plant tissue is being feed to an animal with 
little or not further processing it is preferred that the plant tissue being administered is 

25 edible. Furthermore, the protein obtained from the transgenic plant may be extracted 
prior to its use as a food supplement, in either a crude, partially purified, or purified 
form using method commonly known within the art. In this latter case, the protein 
may be produced in either edible or non-edible plants. 

30 Expression of a fusion protein of rhe present invention, for example but not 

limited to an AOP.D-epitope of TGEV fusion, within a plant host indicates that fusion 
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proteins produced and expressed as described herein are biologically active and are 
capable of eliciting an immune response within an animal to which they are 
administered (see Example 5). 

5 The above description is not intended to limit the claimed invention in any 

manner, furthermore, the discussed combination of features might not be absolutely 
necessary for the inventive solution. 

The following examples, while illustrating the embodiments of the invention, 
10 are not to be considered as limiting the scope of this invention in any manner. 

EXPERIMENTAL EXAMPLES 

EXAMPLE 1: CONSTRUCTION OF THE GENE ENCODING THE P02D 
15 FUSION PROTEIN 

A fusion protein comprising a truncated version of an alfalfa pollen protein, 
designated as P02, fused to a modified D-epitope from the S-protein of TGEV was 
prepared. P02 was truncated after alanine-237 (see Figure 1), and a 17 amino acid 
20 synthetic epitope was added to the carhoxy-terminus. The amino acid sequence of the 
complete fusion protein is shown in Figure 3 (SEQ ID NO:3). 

The initial step in creating the gene encoding this fusion protein involved the 
restriction digestion of P02 cDNA that had been subcloned into pBluescript SK II + 

25 at the EcoRI and Xnol sites (Stratagene, sec Figure 4 ). ?02 v/as first excised from 
the pBluescript vector by restriction digestion with Xbal (Life Technologies). The 
resulting fragment of approximately 820 base pairs was purified from a 1% agarose 
TAE gel using the Clontech PCRPure kit according to the instructions of the 
manufacturer. This P02 fragment was then cut at position 710 by the Haell 

30 restriction enzyme (New England Bioiabs) (see Figure 5; SEQ JD NO:4), and the 
desired 800 base pair fragment representing the truncated P02 was extracted from an 
agarose gel, once again using the PCRPure kit. 
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The 5' overhang (sequence GCGC) generated by the cutting of P02 with 
Haell was matched by a complementary strand in the DNA encoding the D-epitope. 
This double stranded DNA fragment was created by annealing two synthesized 60- 
mers (see Figure 6; SEQ ID NOs: 5 and 6). A mixture of 250 pmols of each oligomer 
5 was prepared in a 50 |Lil reaction volume with lx PGR buffer (lOx buffer for Taq DNA 
polymerase - Life Technologies). This reaction mix was heated at 90-94 "C for 5 
minutes and then slowly cooled to room temperature. The final double-stranded 
sequence contained a 5'HaeH 'sticky end' and a 3 "SacI 'sticky end*. The codons used 
in this sequence were those found to be preferred in alfalfa (Plant Molecular Biology 
10 Labfax, pp.37-40). 

The truncated P02 gene and the annealed oligomers encoding the D-epitope 
were ligated into the pBluescript vector iv. an antisense orientation between the Sad 
and Xbal sites. To achieve this, pBluescript underwent restriction digestion with Sad 

15 and Xbal (Life Technologies), and was retrieved from a 1% agarose TAE gel using 
the PCRPure kit. The purified cut vector, along with the P02 fragment and the 
synthetic epitope, were ligated with T4 DNA ligase using the procedure for cohesive- 
end ligation provided by the manufacturer (Life Technologies). Approximately 20 ng 
of pBluescript, 20 ng of truncated P02 and 4.5 ng of the synthetic DNA were used in 

20 a 20 |il volume, representing a roughly 1:4: 10 molar ratio of fragments. The expected 
ligation product is depicted in Figure 7. 

Following the ligation step, 80 jxl of calcium chloride buffer (50 mM CaCl 2 , 
10 mM Tris-HCl; j;H 8.0) was added to the ligation reaction. This final 100 ul 

25 volume was used to transform Escherichia coli strain DH5tt using the heat-shock 

tranformation protocol detailed in Current Protocols in Molecular Biology (Ausubel et 
al 9 editors). Transformants were selected on the basis of blue-white colour screening 
on LB plates containing ampicillin (50 ug/ml), X-gal (20 |i.g/m!) and IPTG (0.1 mM), 
and screened by solony FCR (procedure derived Lee and Cooper, 1995). The primers 

30 used included one specific for the 5' end of P02 (cgctggccatatgggtttgaaaaacfcc; SEQ 



WO 01/16339 IP W PCT/CAOO/00977 



20 

ID NO:7) and the reverse primer for pBluescript (Stratagene sequence - 
ggaaacagctatgaccatg SEQ ID N0:9). 

One colony was found to have a PGR product of the expected size, and the 
5 plasmid was extracted and analysed by restriction digest. Double digestion with Xbal 
and Sad yielded a fragment of the expected length. Sequencing of this using the T3 
primer indicated that the 3' end of the construct was as expected. This P02D gene 
was cut from pBluescript usinn Xbal and Sad and gel purified from a 1% agarose 
TAE gel (NucleoTrap kit by Clontech according to instructions provided). 

10 

EXAMPLE 2: INTRODUCTION OF P02I) INTO PLANT 
TRANSFORMATION VECTORS 

The bimry vector pBTSNl (see figure 8 and Ni et al 1 9?5 The Plant Journal, 
15 7:661) was used for plant transformation In order to introduce P02D into this vector 
it was first cut with the restriction enyzir.es Sad and Xbal and then the digestion 
mixture was run on a 1% agarose TAE gel The band of approximately 10 kb was 
excised using :l\t NucleoTrap kit. This fragment was ligated to the P02D fragment 
using T4 DNA ligase in a 20 \il volume. E. coli DH5a was transformed using 10 \i\ 
20 of this ligation nrx and th? rsdj/heat shock procedure described previously Colony 
PCR using the 6G-njer D-anti sense (SEQ ID NO:6; Figure 6) and z primer specific for 
the superpromotsr in pBISNl (aagaacgpatgcgegtg; SEQ ID NO:9) indicated that the 
ligation had been successful. 

25 EXAMPLE 3: TRANSFORMATION OF PLANTS 

In order to introduce the P02 gene into tobacco and alfalfa plants, the binary 
vector had to first be introduced into an Agrobacteriwn tumefatiens bacteria strain 
C58CIRif containing the heloer plasmid dMP90 (Koncz and Schell, 1986 Mol. Gen. 
30 Genet. 204:383). This was achieved using a protocol modified from Holsters et al. 
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(1978 Mol. Gen. Genet. 163:181). The presence of pBISNl in the A. tumefaciens was 
assayed using colony PCR (D-antisense and superpromoter primers). 

One colony containing the modified pBISNi vector with the P02D gene was 
5 used to transform alfalfa and tobacco plants. 

Alfalfa transformation was performed according to a procedure described on 
the following web page: , vvvv/.o acAio gue!p h.ca/w ww/CRSC/embr\ / o/proced.html . 

10 Tobacco leaf discs were transformed using z. protocol described in Fisher et al 

1995 Plant MoL Biol. Rep. 13:278. 

Once transgenic plants were recovered they were potted and introduced into 
the greenhouse. 

15 

EXAMPLE 4: ANALYSIS OF TRANSGENIC PLANTS 

The putatively transformed plants were screened using PCR with Taq DNA 
polymerase from Life Technologies (using manufacturer's instructions). Genomic 
20 DNA was extracted using a method adapted from Edwards, Johnstone and Thompson 
(Edwards et aL, 1991 Nucleic Acid Res. 19: 1349). Positive plants were further 
analysed for protein expression using Western blots with denatured protein. 

Protein from transformed alfalfa plants was extracted by grinding 50-100 mg 
25 of plant tissue in 2x volume of TBS buffer (50 mM Tris base, 1 50 raM NaCl; pH 7.5) 
in the presence of Ix. protease inhibitor (Boehringer Manheim). 6x SDS loading 
buffer (62.5 mM Tris-Cl, 2% SDS, 5% P-raercaptoethanoL, 10% glycerol) was added 
to a final concentration of 1.x. This mixture was incubated at 55-60 °C for 15-20 
minutes and vortexed thoroughly before being centrifuged (15000 rpm; 10 minutes; 
30 room temperature). The supernatant was transferred to a fresh tube and stored at -20 
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Prior to running the protein on an acrylamide gel the solution was boiled at 95 
"C for 4 minutes and centrifuged 4 minutes (15000 rpm; room temperature). 10-30 \il 
of extract was run at 150 V on an SDS-contaimng 10% poly acrylamide gel made 
according to a procedure found in Gei Electrophoresis of Proteins (Hames and 
5 Rickwood ed. 1981. pp 24-35). After approximately 1 hour, the protein was 

transferred to a polyvinylidene difluoride (PVDF) membrane. This was done at 4 # C 
in SDS transfer buffer (25 mM Tris-base, 190 mM glycine, 20% methanol, 0.1% 
SDS) at 30 V for overnight. A duplicate protein ge! was stained, with Coomassie 
brilliant blue and relative amounts of protein per lane estimated. 

10 

Blocking of the PVDF membrane was accomplished by incubating the 
membrane with 3% milk powder in TBS overnight at 4 *C. The primary antibody 
(polyclonal anti-P02 IgY from chicken: courtesy of A. Agyare-Tabbi) was prepared 
in a 1:2000 dilution with 1% milk powder in TBST (TBS with 0.5 ul/1 Twesn-20), 

15 and the blot was incubated in this for 90 minutes at room temperature. This was 
followed by 3-4 five minute washes in T3ST. The secondary antibody used was 
mouse anti-chicken IgG conjugated to alkaline phosphatase (Sigma, catalogue no. A- 
9171), this was added as a 1:10000 dilution in 1% milk powder in TBST. Four 10 
minute washes in TBST followed and the blot was developed with BCIP and NBT ( 

20 Alkaline Phosphatase Substrate Package -Lffe Technologies) according to the 
instructions of the manufacturer. A representative blot is included as figure 9. 

Similar experiments were performed with transgenic tobacco. Preliminary 
evidence suggests that P02 and derived fusions are expressed highly in leaf tissue of 
25 tobacco. 

EXAMPLE 5: ANIMAL STUDIES 
Jjvi^ctLor^ 



30 
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Groups of five BALB/c mice were intraperitoneaily (Lp.) injected once weekly 
for four weeks, with 1 mL/mouse of crude soluble protein extract from leaf tissue 
ground in PBS buffer (see above). Four separate mouse groups were used in this 
experiment, resulting in a total of 20 mice. Group 1 was injected with P02-D alfalfa 
5 extract, group 2 with P02-D tobacco, group 3 with control alfalfa, and group 4 with 
control tobacco. Blood was retrieved from mice using a tail bleeding procedure five 
days after the last injection. The blood was kept at room temperature for 1 hour until 
clotted and then centrifuged in a microcentrifuge at maximum speed for fi ve minutes. 
The serum was kept at -20°C until needed. 

10 

Serolo gy 

Virus centra lizat ion assay. The virus neutralization (VN) assay was 
performed using a technique described by Tuboly et ah (1994). Serial dilutions of 

15 mouse blood sera were made h 100 yL volumes of Dulbecco's Minimum Essential 
Medium (DMEM) and incubated with 100 p.f.u. of TGE virus (Purdue-115 strain) at 
37°C for 1 hour. These dilutions were transferred into a 96-weIl microtitre plate on 
which swine testicular (ST) cells had been grown to confluency. The plates were 
incubated in a humidified environment with 0.5% C(X and monitored for the 

20 formation of cytopathic effects (CPE). The first signs of CPE appeared between 18 
and 24 hours oo?t-inocu!aticn. The neutralizing antibody determined to be the highest 
dilution in the mixture applied to a well where CPE was not present. 

ELISA, The ELISA assay used was described by Cubero et ah (1993). It is a 
25 fixed cell assay, in which ST cells had again been fixed to a 96- well plate. Here, the 
cells were infected with Purdue-115 strain of TGEV but washed with PBS and fixed 
with acetone (80% v/v) before any cytopathic effect had occurred. After washing 
with PBST (PBS buffer wi* 0.05% Tween-20) the ptete was blocked for 15 minutes 
at 37°C in 1% milk powder in PBST. The plate was washed three times with PBST, 
30 and serial dilutions of antibody in PBST were added to each well, incubated for 1 

hour at room temperature, washed three rimes with PBST, and anti-mouse rabbit IgG- 
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horseradish peroxidase (HRP) conjugate wr.s added in a dilution required by the 
manufacturer (Kirkegaard and Perry Laboratories). After incubating the plate for 1 
hour at 37°C, the plate was washed with PBST and developed with 2,2'-azino-di-3- 
ethylbenzthiazolins-6-sulfonave (ABT3). Following a 30 minute incubation in a dark 
5 area at room temperature, the reaction was stopped and the OD403 measured. The 
TGEV-specific antibody litre of the serum was determined io be the highest dilution 
at which an OD measurement twice a? high as the control wells and at least 0.1 was 
made. 

10 Stati stical an aly sis 

Alfalfa and tobacco data were compared by performing a Student's t-test 
following the determination of a geometric mean titre (GMT) for serum retrieved 
from injected animals. The geometric titres (GT) used in calculating the mean were 

15 the log2 of the inverse of the detected serum titre (e.g. a titre of 1 : 16 had a GT of 4). 
Serum from animals injected with extract from control plants did not neutralize virus 
in the VN assay. A positive response in the ELISA was based on a OD405 twice as 
high as the negative control nnd at least 0. 1, and no further statistical analysis was 
performed in the comparison of the pooled control sample to the samples collected 

20 from P02-D vaccinated animals. 

Immune response to alfalfa extracts 

All the serum extracted from mice injected intraperitoneally with an extract 
25 obtained from transgenic alfalfa comprising PQ2-D showed detectable antibodies 
(Table 1). Two of the five mice in the P02-D group died of shock directly following 
the injection and no serum was retrieved. Serum from mice injected with control 
alfalfa extract (group 3) was pooled, and no antibodies capable of neutralizing TGEV 
were detected. The ELISA OD^ values for the control sera were used as a measure 
30 of the absence of TGEV-speoificity, and all positive dilutions were determined to 
have an OD twice as high as these numbers and at least 0.1. The virus-neutralizing 
activity (see abov 5 ;) of TGEV-specific raubodies was determined by an absence of 
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CPE on the microti ire plate, however after more extended times (2-3 days) cellular 
infection could be seen. 



10 



Table 1: Alfalfa-immunized (Group 1) mice 

TGEV-specific antibody tifre 

Mouse VN ELISA 

1. 1:16 ~ " " " 1:20 

2. 1:8 1:20 

3. 1:32 1:40 



Immune response to tobacco extracts 



Three of the four serum .samples from the mice exposed i.p. to tobacco plant 
15 extract showed antibodies detectable in VN and ELISA assays (Table 2). Again, the 
serum from mice injected with control tobacco (group 4) was pooled before the assay, 
and no TGEV-specific antibodies were detected. One rrx-uss fr;:m the P02-D 
injection group died of shock following plant extract administration, and is not 
represented here. Similarly to the VN assay described in the alfalfa experiment 
20 above, after prolonged incubation (>3 days), CPE could be observed. 



Table 2: Tobacco-immunized (Group 2) mice 

TGEV-specific antibody titre 

: "'V- Mouse ^ ^_ Y5L EUSA^ 

25 ; . l. r~ 7:4 " "~ ~T:10 

■ • 2. 0 0 

. 3. 1:2 1:10 



30 



WO 01/16339 




PCT/CAOO/00977 



26 

A graphical representation of this data is shown below (Figure 10), in which 
the geometric mean titres of the alfalfa and tobacco serum samples are compared. The 
difference in TGEV-specific antibody response to the plant extracts is significant, at 
P<0.05 for the VN data and P<0.02 for the ELISA data. 

5 

While the present application has been described with reference to what are 
presently considered to be the preferred examples, it is to be understood that the 
invention is not limited to the disclosed examples. To the contrary, the invention is 
intended to cover various modifications and equivalent arrangements included within 
10 the spirit and scope of the appended claims. 

All publications, patents and patent applications are herein incorporated by 
reference in their entirety to the same extent as if each individual publication, patent 
or patent application was specifically and individually indicated to be incorporated by 
15 reference in its entirety. 
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1. A method of expressing a foreign gene in a cell comprising the steps of: 

(a) making a fusion construct comprising one or more regulatory elements in 
operative association with a chimeric gene, said chimeric gene comprising a 
first nucleic acid encoding an arabinogalactan protein or a fragment thereof, 
and a second nucleic acid encoding a foreign protein or peptide to be 
expressed in the cell; and 

(b) introducing said fusion constnict into a host ceil so that said cell expresses 
said chimeric gene. 

2. The method according to claim 1 wherein said cell is a plant cell. 

3. The method according to claim 2, wherein in said step of making, said first 
nucleic acid encodes an arabinogalactan protein that has a non-structural and non- 
enzymatic function. 

4. The method according to claim 3, wherein said first nucleic acid encodes an 
arabinogalactan protein that has a hydrophobic tail at the car-boxy terminus. 

5. The method according to claim 4 wherein said first nucleic acid comprises the 
sequence shown in SEQ ID NO: I (Figure I). 
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6. The method according to claim 2, wherein said plant is selected from the 
group consisting of food and food crop plains. 

7. The method according to claim 6, wherein said plant is selected from the 
group consisting of alfalfa, soybean, pea and clovers. 

8. The method according to claim. 1, wherein in said step of making, said second 
nucleic acid encodes a protein or peptide of pharmaceutical interest. 

9. The method according to claim 8, wherein said protein or peptide is selected 
from the group consisting of an antibody, antigen, antibiotic, growth factor, hormone, 
lymphokine and activator. 

10. The method according to claim 2, wherein said protein or peptide is selected 
from the group consisting of a protein or peptide that enhances herbicide resistance of 
said plant and a protein or peptide tha*: enhances stress resistance of said plant. 

1 L The method according to claim 9, wherein said protein or peptide is a viral 
antigen. 

12. The method according to claim 11, wherein said viral antigen is a porcine 
transmissible gastroenteritis virus (TGEV) antigen. 
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13. The method according to claim 12, wherein said TGEV antigen comprises a 
modified D-epuope of porcine TGEV encoded by the nucleic acid sequence of SEQ 
ID NO:4, SEQ ID NO:5, or a combination thereof. 

14. The method according to claim 1, wherein in said step of making, said second 
nucleic acid encodes a peptide comprising about 3 to about 50 amino acids. 

15. The method according to claim 14, wherein said peptide is selected from 
human epidermal growth factor, porcine epidermal growth factor and human 
glucagon-like peptide. 

16. The method according to claim 2, wherein said fusion construct is introduced 
into said plant cell by Agrobacterium-mtdi&led transformation. 

17. The method according to claim 1, wherein in said step of making, said fusion 
construct further comprises a selectable marker. 

18. The method according to claim 1 wherein in said step of making, said 
regulatory regions comprises a promoter effective in a plant cell and a transcriptional 
terminator effective in a plant cell. 

20. The method according to claim 1, wherein in said step of making said fusion 
construct does not contain a trypsin cleavage site. 



WO 01/16339 



PCT/CA00/00977 



30 

21. The method according to claim 1 , wherein said fusion construct encodes a 
trypsin cleavage site located between said first and second nucleic acids. 

22. An isolated DNA molecule comprising one or more regulatory elements in 
operative association with a chimeric gene, said chimeric gene comprising a first 
nucleic acid encoding an arabinogalactan protein or a fragment thereof, fused with a 
second nucleic acid encoding a foreign protein or peptide. 

23. A vector comprising said isolated DNA molecule of claim 22. 

24. The vector according to claim 23 further comprising a DNA sequence 
encoding a trypsin cleavage sice located between said first and second nucleic acids. 

25. A cell transformed with said vector according to claim 23. 

26. The cell of claim 25, wherein said cell is a plant cell. 

27. A transformed plant comprising said vector of claim 23. 

28. A transgenic seed comprising said vector of claim 23. 

29. A method for the preparation of a peptide or protein for administration to an 
animal comprising; 
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30. 



i) transforming a plant with a vector as defined in claim 23 to produce a 
transformed plant; 

ii) growing and harvesting said transformed plant; and 

iv) obtaining plant tissue from said transformed plants for administration 
to an animal. 

The method of claim 29, wherein in the step of obtaining, the peptide or 
protein of interest is isolated from said transformed plant prior to 
administration. 



31. 



The method of claim 29, wherein in the step of obtaining, the plant tissue is 
orally administered to said animal. 
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Figure 6 
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SEQUENCE LISTING 

<110> Bailey, Andrea 
Erikson, Larry 

<120> Method of Expressing Proteins and Peptides in Plants 

<130> 08-884443US.1 

<140> 
<141> 

<150> US 60/151,147 
<151> 1999-08-27 

<160> 9 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 938 
<212> DNA 

<213> Alfalfa pollen arabinogalact:ar. 
<400> 1 

gaaagcaaga aaaeagaaca atgggtttga aaaacacctc tgttccgtgc ttggccatta 60 

tgttgtcttt ctcttcggcg attaattgtt t^acataac caaaattatg ggacaatacc 120 

cggaactctc cacgttcagc aaatacccaa ccgaaaccaa gttggctgac caaataaaca 180 

gtggtaaagc cgttaceatt cttgcccctg acaacaaagc ta\:tecttct ctttctggta 240 

aaccacttga cgccattaaa gccgttattg gaacecatgt catacctgaa tcttacgatg 300 

agaaaaaact ttttgatatc e.ttggaagcc atgcacaatc gcctacactt tcaactgcgc 360 

caggtttagc cgctaaaacc tacetaecgc ttattaatga aggtgaaatg gcatttagtt 420 

ctgctgttga aggctcaacc ttcgatgcca cacttgttca aagcactgag gctgaaccag 480 

gcgttgttga gattcrccag gcttcacagc caatcgttaa ggttggtgct tcagctccgg 540 

caacaccagc aactccagca acaccagcaa caccatcaaa accagcaaca cctgcagctg 600 

tgtcaacatc cagtgccggt gatgttgcaa ei^ccagcagc atcaccttct gtggtgattg 660 

ccgagtcacc taacagtgtt cctgagccac ctg&gagttt tggtgangca cctgctcctg 720 

ctcctagcgc ctcttctcgt gccacattcg gattcattgg tgctgttate gccttcgctt 780 

ccatttttgt ctctttgtaa scgttatata etc,: etttx-t agagagaaaa ttraaacaata 840 

ttgattggaa gtgataaatt aett&at'-tg tr.OwU$,ct.€,g g:,;..aaaatca afcgtaacacc 900 

gttgatttgt gcttauggga ttcarttgtt c-a«tit*ga 938 



<210> 2 
<211> 259 
<212> PRT 

<213> Alfalfa pollen arabior-galactaL. 
<400> 2 

Met Gly Leu Lys Asp Thr Se:: Val Leu Cyt Leu A5? Il.e Met Leu Ser 
* 5 1C 15 

Phe Ser Ser Ala lie Asr. Cy& Leu At-:? Jl* Zl r I.*, lie Met Gly Gir 
20 30 

Tyr Pro Glu Leu Ser Thr Phe Ser Lys ry:c Lsu Tkr Glu Thr Lys Leu 
35 £0 45 

Ala Asp Gin lie Asa Ser Gly Lys Ala Val Thr lis Leu Ala Leu A.sp 
50 £5 i'O 
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Asn Lys Ala He Ala Ser Leu Ser Gly Lys Fro Lei: Asp Ala H e Lys 
65 70 75 80 

Ala Val He Gly Thr His Val He Pro Glu Fhe Tvr Asp Giu Lys Lys 
85 90 95 

Leu Phe Asp He He Gly Ser His Ala Gin Leu Pro Thr L«u Ser Thr 
100 ICS no 

Ala Pro Gly Leu Ala Ala Lys He Tyr Val Ser Lev. He Asn Glu Gly 
115 120 125 

Glu Met Ala Phe Ser Ser Ala Val Glu Gly Ser Thr Phe Asp Ala Thr 
130 135 xco 

Leu Val Gin Ser Thr Glu Ala Glu Pro Gly Val Val Glu He Leu G^r 
145 150 1S5 160 

Val Ser Gin Pro He Val Lys Val Gly Ala Ser Ala Pro Ala Thr Pro 

1€5 *?0 ://5 

Ala Thr Pro Ala Thr Pro Ala Thr Pro Ser lys Pro Ala Thr Pro Ala 

180 185 150 

Ala Val Ser Thr Ser Ser Ala Gly Asp Val Ala Thr Pro Ala Ala Ser 

155 . 2C0 205 

Pro Ser Val Val lie Ala Glu Ser Pro Asn Ser Val Ala Glu Ser Pro 
210 r:;..5 210 

Glu Ser Phe Gly Asp Ala Pro Ala Pro Ala Pro Ser Ala Ser Ser Arg 
225 230 ?:? 240 

Ala Thr Phe Gly Phe He Gly Ala Val lie Ala -Phe Ala Ser He Phe 

245 ;.£C 255 

Val Ser Leu 



<210> 3 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secuerce: fcGP--t;-e?:.tope 
of TGEV fusion protain 

<400> 3 

Met Gly Leu Lys Asn Thr Ser Val Lev Lys Leu All He Met Le<«- Ser 
1 5 10 15 

Phe Ser Ser Ala He Asn Cys L&u Asp lis Thr Lys He Met Gly Gin 
20 25 30 

Tyr Pro Glu Leu Sei Thr Fhe Ser Lys Tyi Leu Thr Giu Thr Lvs Leu 
3 5 40 45 

Ala Asp Gin He Asn Scir G2} Lys Ala v Thu He Lev A.' a Le" Aso 



6C 
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Asn Lys Ala lie Ala Ser Leu Ser Gly Lys Pro Leu Asp Ala lie Lys 
65 70 75 80 

Ala Val He Gly Thr His Val He Pro Glu Phe Tyr Asp Glu Lys Lys 
85 90 95 

Leu Phe Asp He He Gly Ser His Ala Gin Leu Pro Thr Leu Ser Thr 
100 105 110 

Ala Pro Gly Leu Ala Ala Lys He Tyr Val Ser Leu lie Asn Glu Gly 
115 120 125 

Giu Met Ala Phe Ser Ser Ala Val Glu Gly Ser Thr Phe Asp Ala Thr 
130 13S 14: 

Leu Val Gin Ser Thr Glu Ala Glu Pro Gly Val Val Glu He' Leu Gin 
145 150 155 160 

Val Ser Gin Pro He Val Lys Val Gly Ala Ser Ala Pro Ala Thr Pro 

165 170 175 

Ala Thr Pro Ala Thr Pro Ala Thr Pro Ser Lys Pro Ala Thr Pro Ala 

180 165 ISO 

Ala Val Ser Thr Ser Ser Ala Giy Asp Val Ala Thr Pro Ala Ala Ser 

195 200 205 

Pro Ser Val Val He Ala Giu Ser Pro Asn Ser Val Ala Glu Ser Pro 

210 215 220 

Glu Ser Phe Gly Asp Ala Pro Ala Pro Ala Pre Ser Ala Ser Phe Phe 
225 210 235 240 

Ser Tyr Gly Glu He Gin Leu Ala Lys Asp Lys Val Asn Glu 

245 250 



<210> 4 
<211> 950 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secu'cnce.: fcranacateci 
alfalfa pollen AGP 

<400> 4 

caatactacg tggaaagcaa gaa<Laae.gaa caatgggtct gaaaaaeacc tctgttctgt 60 

gettggecat tatgttgtct ttctcttegg cgattaattg tttggacata accaaaatta 120 

tgggacaata cccggaactc tccacgttca gcaaatacct aaccgaaacc aagttggctg 180 

accaaataaa cagtgguaaa gccgttacca t~cfctgccct tgacaacaaa getattgett 240 

ctctttctcg taaaccactt gacgcc&fcta aagccgttat tggaacccat gttatacctg 300 

aattttacga tgagaaaaaa ctttttgata tcattggaag ccatgcacaa ttgcctacac 360 

tttcaactgc gcc&ggutta geegctaaaa tetaegtate gcttattaat gaaggtgaaa 420 

tggcatttag ttctgctgtt gaaggctcaa ccttcgatgc cacacttgtt caaagcactg 480 

aggctgaacc aggcgttgtt gagattctcc aggtttcaca gccaattgtt aaggttggtg 540 

cttcagctcc ggcaacacca gcucJCteuug cc^^cagc &uci:eca.tca aaaccagcaa 600 

cacctgcagc tgtgccc-aca tccagtgccg gugatgttge aaeaccagca gcatcacctt 660 

ctgtggtgat tgecgagtea cctaacagtg ttgctg&gtc acctgagagt tttggtgatg 720 

cacctgctcc tgctcc^agc gccucttctc gtgecacatt eggattoatt ggtgctgtta 780 

ttgcctttgc ttccau.ttt gtctctt,~gt aagctccata tc.^tgx.&ztz ctagagagaa 840 
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aatgaaacaa tattg&ttgg aagtgataaa ttsattaact cgttgctgct gggaaaaaat 900 
caatgtaaca ccgttgattt gtgcttatgg gattcatttg 'cxcagattga 950 



<210> 5 
<211> 60 
<212> DNA 

<213> Artificial Sequencer 
<220> 

<223> Description of Artificial Sequence : sense ciigo 
encoding modified TGEV epitope 

<400> 5 

ctcatttttt tcetacggag aaattc&att ggcwaagcp.t aag&tUiatg' aataagagct 60 

<210> 6 
<211> 60 
<212> DNA 

<213> Artifici&l Sequence 
<220> 

<223> Description of Artificial Sequence ;<mti beitse cligo 
encoding modified TGEV epitope 

<400> 6 

cgcggagtaa aaaa-tetatg cctcl:tt-£.ag r^e.rts/Ett. cc t?t!:c.caa ttacttattc 60 

<210> 7 
<211> 28 
<212> DNA 

<213> Artificial Sequence. 
<220> 

<223> Description of Artificial Scorn ^C€.:prin cr IK 2 
<400> 7 

cgctggccat atgggcctga aaaacacc 28 

<210> 8 
<2U> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :oligo for 
Bluescript 

<400> 8 

ggaaacagct atgaccatg 19 

<210> 9 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence :p3 1 SNI primer 
<400> 9 

aagaacggaa tgcgcgtg 
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